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ABSTRACT 



A mathematical definition of an information network is constructed with 
the purpose of developing a theory useful in answering practical questions 
concerning information transfer. An information network includes: (1) use^s, 

(2) information resources, (3) information centers * and (4) the total infor- 
mation transfer structure linking (1), (2), and (3)* Emphasis is placed on 
the message transfer structure, as distinguished from the documferit transfer 
strueture s to identify some basic network configurations. 

Any message transfer structure is shown in graph theory concepts to 
be either isographic or nonisographic* Among the iaographic structures, 
the cyclic and decentralized networks are defined. The strictly hierarchical 
network is also defined, and the 2-regular network, reflected in the ARPA 
design, is identified. 

Measures of network structure, in particular the accessibility and 
flexibility in message transfer, are developed. These measures for the 
basic structures are used to characterize more general structures* While 
some comparisons of message transfer structures can be made, development 
of more comprehensive measures is a necessity. 



I . INTRODUCTION 



Contribution of mathematical models 

The concept of information network evokes markedly different responses 
in this period of information deluge and computer panaceas. While librarians 
have sought for years to understand, organize and control this essential 
commodity — information, a latent uneasiness, perhaps even distrust, surfaces 
at the mention of "information network". Library networks seemS a more 
comfortable term, divorced from the esoteric hardware Structures implied by 
"communication networks", "computer networks", and "information networks". 

Some maintain that information must be transferred by electronic means 
in order for an information network to exist. R. C. Swank [1] comments on 
the conditions imposed by several definitions of "information network" and 
offers his own definition in terms of the characteristics: (1) information 

resources, (2) users, (3) intellectual organization of documents or data, 

(4) methods for delivery of resources, (5) formal organization and (6> bi- 
directional communications networks. While definition in terms of these 
characteristics provides a basis for discussion, little can be said defini- 
tively about networks in general, the relation of one network’s characteristics 
to another, and the similarities among networks. 

lie offer a mathematical definition of information network and cdmpare 
our definition with Swank's. Out definition hds the disadvantage, at least 
viewed by some, of a higher level of abstraction. We accept the abstraction 
in order to gain rigor that leads to definitive statements about network 
structures and differerices and similarities dmong networks. 
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Information netwotk concepts 



We do not attempt to cite the extensive body of literature on infor- 
mation networks. An excellent bibliography can be found In the paper by 
Samuelson [2]. Recent attempts to organize the concept of library or informa- 
tion networks generally have followed one of two approaches* The first 
approach is to define a network in terms of its functional organization. 

Swank II] provides an excellent description using this approach. The 
second approach is to dwell on the structure for information transfer and 
be less concerned with the functions served by this transfer. Nance [3j 
offers an example of this approach. 

A few anchors have followed both approaches to some degree* Duggan 
[4* pp. 159-160] describes the "twelve critical components ** ^essential to 
orderly , planned development of the objectives (of a library network) 11 . 

She then cites several network configurations ; thus recognizing the different 
possible structures by which information transfer can be accomplished# 

Duggan suggests six structural forms, and we show these in Figure 1, She 
also presents the number of "channel links" (C) required by each structure. 

Davis [5] uses botk approaches in her description of the National 
Biomedical Gorranunications Network and a design procedure for networks in 
general. With respect to the structure for information transfer, she 
identifies four types of network organization, which are shovri in Figure 2. 

The centralized and decentralized structures identified by Davitt correspond 
to the directed and non-directed forms of Duggan; moreover, the composite 
centralized structure is analogous to Duggan* s representation of the inter*- 1 
face of two directed networks# 

The fourth structure offered by Davis, the hierarchical network, is of 
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(a) Interface of Two Non-Directed Networks 




C » N- I e 6 



(b) Directed Network 





(e) Non-Directed Network Including Specialized 

Center 



(d) Non-Directed Network 





(e) Interface of Two Directed Networks 
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(£) Directed-Network Including Specialized 

Center 



Figure 1. Network Classifications Offered by Duggan [4] 
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(a) Totally Decentralized Network 



(b) Totally Centralized Network 
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considerable interest. She notes that while the control system complexity 
increases with the hierarchical stureture, more flexibility in intercommunica- 
tions and greater reliability of the network are gained [5, p. 35], 

In a different context, Kleinrock [6] has used different terminology 
to identify the identical structures In communication networks. The star- 
net configuration [6, p. 28] corresponds to the centralized structure shown 
by Davis, and the fully connectured net [6, p, 101], to Davis’ decentralized 
configuration . 

Duggan, Davis and Kleinrock have used the theory of directed graphs, 
although not explicitly, to enable them to identify different structures. 

Nance [3] explicitly employs a graph-theoretic approach to model a library 
network. No one has sought to examine the relationships among the structures 
in a comprehensive and definitive manner. 

To answer questions concerning which structure is best for accomplishing 
certain objectives, we must first be capable of defining the structures. 

Our effort is to develop h unifying concept of the term "information network", 
and from this concept to construct a methodology by which structures for 
information transfer can be evaluated and compared. 
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II. A UNIFYING CONCEPT OF NETWORK STRUCTURES 



We define an information network as a set 
N = {U,I,C,A,£, f'} 



where 

U is the set of users (either potential or actual) of information 
resources (i£j); 

I is the set of information resources that are accessed by users (ue£/) ; 

C is the set of information centers and with each c£( 7 is associated 
a subset Us£/ and a subset Is! , either one of which, but not both, 
may be empty; and 

A is the set of ail edges on UvIvC 9 where an edge from node to 
v denotes that is directly accessible from v^, and each edge 
(Vi, v 2 )e^|c' (the edges of C) carries one or both of the labels: 

m indicating that message transfer, or 
b indicating that document transfer 

can be accomplished from to The distinction between message 

and document transfer is made in subsequent paragraphs. 

Four subgraphs of interest are: 

1. Ny = <UvC, arcs between U and C> and 

Vu £ U there exists at least one edge (u,e) for ceC, 

2. Nj = <jvC, arcs between J and C> and 

Vi £ I there exists at least one edge (i ,c) for ceC , 

3- G = <C*, arcs with label m joining nodes in C> , and 
4. G’ ^ <0 5 arcs with label b joining nodes in €> m 

Two conditions that are imposed on the relationships among £/, I and Cl 

1* the subgraph <U* J, a| ^ j> is totally disconnected, 1 . e- , no direct 
access is permitted between users, information resources, or users 
and information resources; and 

2* the subgraphs of Q and G’ are strongly connected. 

The functions f and f f constitute the Information transfer structure 
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(i.t.s.) for the network. To define these functions we use 



~ {all open paths in 



G> and I’ b (C) - {all open paths in G'}1 Foj 



an individual path PeP (C) or PeP (C'i u/n'i / j . 

m ^ let ^vP) — {nodes ceC’fceP}, Then 



m 



2 UXI by 



and 



1 ’ f(P) h =l; e5t±StS n ° 6dSe CUsC) Wlth l3bel m > c eV(P), then 
2, otherwise, f (P) = x f or 

I - {iejJ there exists an edge (i, c ) wlth label ceV(p)K 



f ’ 5 



2 IxU by 



1- “ ( p)"%r lsts n ° ° d6<! Cu>c) “ ith iabei ». <=«<*>. th„ 

2. otherwise , f'(P) « x f or 

I - {ie/| chare exists an edge <i,c) with label b, c£V(P)J. 

Nete th.t the condition of strong connectivity placed on G and G' assnras 
that . user in contact with any information center has access, through mas- 
sage and document transfer, to any other center. 

One might characterize U and I i„ various ways, but we believe that 
this definition of information network enables a mathematical description 
that capture, the intent of the defining characteristics of Swank [1J. 

The sec of users U is considered finite but not necessarily static. 
Identification of a set member net! is accomplished partly by noting the 

ln£ ° ra, “° n SSS ^ Utilized by “ b ° P»“* “ Particular re,„est for information. 
An information center is defined as an entity where a request for information 

may he made by users <0£ « and/or information resources (I «J> are located. 
Thu, our description of the user necessarily Involves the one or more infer- 



~ 7 — 

An °P en P at h is an alternating sequence of di^rlnr-i- , , 

that each directed edge is adiacenr dlS “ nct nodes a « d ^dges such 

node following it. adjacent from the node preceding it and to the 
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mation centers through which he accesses the information network. 

Information resources (X) are considered representable by a set of 
discrete entities, each having a unique identifier. Individual members 
iel are documents with certain intrinsic labels, including identification, 
characteristics, information centers where copies of the document are stored, 
etc. In addition, extrinsic labels including user identification u, user 
information center £, etc., are attached to a copy of the document indicating 

a transfer of that copy of the document to center £ in response to a request 
by user u. 

Within an information network two types of information transfer take 
place [3, p. 59]: 

(1) message transfer (accomplished by f) — information necessary to 
gain access to the information resources (X) , and 

(2) document transfer (accomplished by f')— supply of the information 
resources (X) to members of V , the set of users. 

One can view the first transfer as involving unlab el ed information, i,e, . 
no final destination is prescribed for the transfer; while the latter con- 
cerns labeled information, i . e ■ , associated with the transfer of each item 
of information ieX is a final destination, or receiver, u e£/. According 
to our definition f is the structure imposed to enable access , the message 
£ ? _ a . n . sfer s t r ucture , and f' is the structure created to effect delivery or 
response , the document transfer structure* 
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III. STRUCTURAL CLASSIFICATION OF INFORMATION NETWORKS 

The graph G is the characteristic used by other authors to classify 
information networks. For example, Duggan's [4J directed network (see 
Figure 1(b)) prescribes a specific form of G. By considering the form of 
G, we can define each specific message transfer structure suggested by 
Duggan [4 j and Davis [5J. Additionally we can characterize structures 
that are not classified immediately as belonging to any structure class, 
e - g » i that shown in Figure 3, 

We state the following definitions for information networks, in all 
cases assuming N - j > i- 

1. An information network N wlth N centers is cyclic if and only if 

idfv^) = od (v^ ) =1 V v^BC , 

2. An information network N with N centers is dece ntralized if 

and only if — 

id(v, ) - od(v.) = N-l V v eC, 2 

x X 1 

The cyclic and decentralized networks define opposite extremes in the degree 
of connectivity among strongly connected digraphs. Both of these networks 
nave message transfer structures (G) which are specific examples of an iso- 
graph [8, p. 330 J. We can refer to the cylcie network as a 1-regular iso- 
graph and the decentralized network as an (N-l)-regular isograph. A third 
isographic network structure is introduced below. 

3. An information network N is strictly heirarchical if the 

311 2 ' CyleS 10 ^ - -Erected L g l i 9 

In general any information network can be characterized as isographic 
or non-isographic depending on the message transfer structure (m.t.s.) G. 



2 The notation idCv^) and od( Vl ) refer to the inward and outward degrees 
of node Vi respectively, l.e., the number of edges of inward and outward 
incidence respectively. “ D 
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Figure 3* An Arbitrary Message Transfer Structure 
-3* Represented by a Digraph 
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The m.t.s. for cyclic and decentralized information networks with 
6 information centers are shown in Figure 4. A strictly hierarchical net- 
work m.t.s, is pictured in Figure 5(a). The resulting undirected tree 
for the m.t.s., shown in 5(b), is obtained by replacing each directed two- 
cycle by a single undirected edge. Note that Figure 3 shows an isographic 
network that corresponds to none of the three above. 

Various other m.t.s. can be proposed. Of particular interest is the 
2-regular isographic network shown in Figure 6. This m.t.s. is found in 
the ARPA network [9] * 

Structural Measures 

Having defined the basic structures, we recognize that any particular 
information network may offer a message transfer structure that agrees 
exactly with none of the above. The benefits from defining these basic 
structures are: (1) precise definitions assure no ambiguity in the eon- 

cepts , (2) advantages of one structure over another in a particular situation 

may be revealed, and (3) any network structure may be discussed by comparison 
with the basic ones. The second claim is explored in a following paper; 
while the third requires the development of measures comparing any particular 
structure to the basic forms* 

Theorem 1 , For a network with N nodes, the minimum Cq^) and maximum (q'*') 
number of edges are given by 
q° = N 
q 1 - N(N-l) 



and 
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(a) Cyclic Network for Six Information Centers 




(b) Decentralized Network for Six Information Centers 



Figure 4. Graphs of the Cyclic and Decentralized Networks 
for Six Information Centers 
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(a) Strictly Hierarchical Structure 




(b) Replacement of All 2-Gycles in G by an Undirected 
Edge Produces an Undirected Tree 



Figure 5* Illustration of the Message Transfer Structures in 
Strictly Hierarchical Networks 
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figure 6. 



The 2-Degree Isographic Network for 
Might Information Centers (ARPA) 
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Proof : (Minimum) By assumption, G is a strongly connected digraph* 

Thus each node must have at least two edges (one inward incident to 
the other) joining it to G, However, by the definition of an infor- 
mation network, access must be provided to each node and each node 
must be able to reach any other node. 

id(v i ) > 1 and od(v, > 1) ¥ i = 1,2,,..,N 

The minimum occurs obviously when 

id(v i ) = od(v ± ) ■ 1 ¥ i = 1,2, . . . S N. 

This requirement can be met with N nodes only if G is a directed cycle 
of N edges , 

(Maximum) To obtain the maximum number of edges, each node should have 
an outward degree of N— 1. With N nodes, the total number of edges is 
N(N-l) . 

Flexibility 

We define 

N N N 

q(G) » £ [id(v.) + od(v.)]/2 — T id(v.) - J od (v.) 

i-1 1 1 i-1 1 i-1 1 

and note that q counts the number of edges in a m.t.s, (G) for an information 

network with N nodes. For the cyclic network q ^ q G ^ q° *= n. 

An obvious consequence of the cyclic network structure is that for 

each node v^eG, where a message is initiated or referred, there exists one, 

and only one, node to which the message may be directly referred. This 

represents the most restrictive structure. The decentralized structure 

obviously is the least restrictive. Since the lack of restriction, or 

conversely the presence of flexibility, in originating or referring messages 

seems intuitively a desirable feature, we label the cyclic network as 
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0-f lexible and the decentralized network as 1-f lexible. 



The measure 




is proposed to indicate the degree of flexibility in a network m,t,s 
with q edges and N nodes. Since 



z(N,q°) - 0 for the cyclic network containing N nodes, and 
z(N s q^) = 1 for the decentralized network. 



we refer to any network as z-f lexible » thus reflecting a proportionate 
degree of flexibility. Several network structures are shown in Figure 7 
to illustrate the measure z(N,q). 

The isographic networks permit a more general computation of flexibility 
Theorem 2 : For a k-regular isographic network with N centers 



Proof : This value results from the fact that there exist exactly kN edges. 

Obviously the flexibility for a k~ degree isographic network is bounded 
by the flexibility value for the k— regular isographic network, i t e. , for 
any k-degree isographic network with N centers 



For a general network (not necessarily isographic), we define the 
degree 6 by 



A second characteristic is that for a fixed degree (6), the network with 
fewer nodes has greater flexibility. 



z(N,q) = k-1 
N-2 



z(N,q) < k—1 
N— 2 



6 = max 

i=l ,2 



N 



{id Cv ± ) , od(v ± )} 
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Figure 7, Network Structures Illustrating Flexibility Measure z(S 
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One recognizes the difficulty in calculating the measure of flexibility 
z(N,q) for as N becomes large and many edges are present, computing the value 
of q by examining the graph becomes quite difficulty Fortunately , the value 
of q may be determined simply using the adjacency matrix representation of 
the graph. More on the adjacency matrix is given in Section IV, 
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XV, USING THE GRAPH-THEORETIC CONCEPTS 

The diagrammatic, representation of graphs proves useful in establishing 
basic definitions and developing an intuitive appreciation of contrasting 
information transfer structures. To explore characteristics of individual 
structures, however, we employ the matrix representation of a graph. 

The adjacency matrix 

Consider an arbitrary assignment of the integers to the N 

nodes of the graph G depicting the m.t.s, of a information network. We 

use the integers as subscripts and identify the nodes by v. s v_, ,v to 

X 2 n 

maintain consistency with our previous notation. Let C be an NxN matrix 
with values c. , where 



Attaching to C the superscripts c and d again to indicate cyclic and 
decentralized d respectively, we picture a general matrix form for each of 
these structures in Figure 8 along with the corresponding graph. (In 
general, different numberings of the nodes yield different matrix repre- 
sentations.) Note that no node is shown to send messages to ItSeXfi there- 
fore, in every matrix the main diagonal consists only of zeros. Also note 
that the matrix representation of G provides a computationally effective 
means of identifying the decentralized and cyclic structures as substructures 
in larger, more complicated networks. 

Let us examine the matrix representation of G for a strictly hierarchical 
network shown in Figure 9(a). The matrix is symmetric, ahd treating only 
the upper triangular portion, we obtain the block structure illustrated in 9(b). 




The matrix C is termed the 'of the labeled graph G. 
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Figure 8. liatrix Representations of Two Basic Graph Structures 
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(b) Matrix reprsstntaEion shows block foritt 
of strictly hierarchical structure 



Figure 9, Information Access Structures in a Strictly 
Hierarchical Network 
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Accessibility 

An interesting and useful property of the adjacency matrix is noted 
in the theorem [8, p. 112] stated below: 

Theorem. Let C be the adjacency matrix of a digraph H« The f,j 

entry in C n specifies the number of walks in H of length 

n from v, to v . 

x J 

In more familiar terms, if we let cj be the i,j entry in C n , theh the 
value of represents the number of distinct message transfers involving 

n referrals that conclude with the message originating at and terminating 
at Vj . The illustration in Figure 10 can be followed ebslly* From the 
original adjacency matrix, C 2 is calculated. The element *> 3 specifies 

that a message originated at v^, referred twice (including the original 
referral from v.^, and terminated at v can follow three distinct 
referral paths, i^e^, v^v^, v^v^, or v^v^, Similarly, all other 
nonzero entries have the value 1, and a message originated at V^ and 
terminated at (likewise for v^ and v^) must requite an Odd numbef of 
referrals. By similar examination of C 3 , C 4 , ... , we can conclude that 



c 1 / . - 0 and c^\ = 0 

n ”jk 



1 = 1 , 2 , . . . 
1 < j < k 
n = 2h-l 



and 



'lj 






= 0 



1 < j 

n = 2h 



1 , 2 ,. 



A node v.j e G is said to be accessible by ^ e G If there artists at 
least one directed path from v± to , Considering the adjacency rahttfik 
G this is equivalent to stating 

V is accessible by v. if and only if there exists n (a positive 
integer) n ; C n has the component c 1 } > 0. 
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(a) The graphical representation of 
the Information access structure 
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(b) Matrix representation 
of the graph 0 



■3 0 0 0 \ 

f 

: 0 1 l l 



0 

\ 



0 



1 

1 



I 1 
i i 



^0 3 3 3 

3 0 0 0 

3 0 0 0 

3 0 0 0 



(g) Number of message referral paths 
v.j. to Vj involving 2 referrals 



<d> Number of referral paths 
involving 3 referrals 



Figure 10. Powers of the Adjacency Matrix Reveal Characteristics 
of Referrals and Informatlou Transfer 
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By definition, every node Vj e G is accessible by every node v { E G if 
G represents the m.t.s. for an information network* An information net- 
work is defined to be p-accessible where p indicates the minimum number 
of referrals necessary to enable complete accessibility, i, et t every node 

has access to every other node. 

n 

p " min n : S (n) = J c J has 

1=1 

elements S ±j > 0 V i # j = 1,2,,.,,N t 
Complete accessibility for any decentralized information network is 
accomplished by a single referral; however, the cyclic network with N 

centers is N-accessible. 
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V. SUMMARY 



Working from a verbal definition of information network, « offer 
mathematical definition in ter- of: ( 1 , UB era, (2) information reaourcea. 

(3) information centers, and (4) the total information transfer Structure 
linking ( 1 ), (2 ) ,„ d (3). We concentrate on the message transfer structure, 
as distinguished from the document transfer structure, td identify some 

basic information networks A mnrn „ . , 

tworRs. A more comprehensive classification should 

include the document transfer structure as well, 

J 

Based on graph theory concepts, any message transfer structure can be 
classified as isographic or nonisographic. Among the iaographic structures, 
characteristics of the cyclic and decentralised networks are encored, and 
S flexibility measure for a general network Is defined using these basic 
forms. The adjacency matrix representation provides a means of assessing 
the accessibility of the Information centers In the network. The property 

of complete accessibility is shown to be markedly different for the cyclic 
and decentralized structures. 

Future research is focused on extending the classification to 
include both document and message transfer structures. Methods of evaluation 
considering the information transfer structure end the information resource, 
resident at each center ere presently being investigated. The Concepts 
developed here are being tested in applications to the public library access 
network CHAN) and to existing university and medical library networks. 

ARPA computer network offers yet another promising area of application. 
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